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Computing zeta functions of nondegenerate hypersurfaces 

with few monomials 



Steven Sperber and John Voight 

Abstract 

Using the cohomology theory of Dwork, as developed by Adolphson and Sperber, we exhibit a 
deterministic algorithm to compute the zeta function of a nondegenerate hypersurface defined 
over a finite field. This algorithm is particularly well-suited to work with polynomials in small 
characteristic that have few monomials (relative to their dimension). Our method covers toric, 
affine, and projective hypersurfaces and also can be used to compute the L-function of an 
exponential sum. 

Let p be prime and let be a finite field with q = p"" elements. Let y be a variety defined 
over ¥q, described by the vanishing of a finite set of polynomial equations with coefficients 
in ¥q. We encode the number of points ^V{¥qr) on V over the extensions F^r of Fg in an 
exponential generating series, called the zeta function of V: 

Z(y,T)=exp(^#y(F,0 — j el + TZ[[T]]. 

The zeta function Z(V, T) is a rational function in T, a fact first proved using p-adic methods 
by Dwork [161 117j . The algorithmic problem of computing Z{V,T) efficiently is of significant 
foundational interest, owing to many practical and theoretical applications (see e.g. Wan |58j 
for a discussion). 

From a modern point of view, we consider Z(V,T) cohomologically: we build a p-adic 
cohomology theory that functorially associates to V certain vector spaces W over a p-adic 
field K, each equipped with a (semi-)linear operator Frobi, such that Z{V,T) is given by an 
alternating product of the characteristic polynomials of Frob^ acting on the spaces W. The 
theory of £-adic etale cohomology, for example, was used by Deligne to show that Z{V,T) 
satisfies a Riemann hypothesis when V is smooth and projective. Parallel developments have 
followed in the p-adic (de Rham) framework, including the theories of Monsky-Washnitzer, 
crystalline, and rigid cohomology (see Kedlaya |35] for an introduction). In this paper, for a 
toric hypersurface V defined by a (nondegenerate) Laurent polynomial / in n variables over 
¥q, we employ the cohomology theory of Dwork, working with a space ff"+^(r2*) obtained as 
the quotient of a p-adic power series ring over K in n + 1 variables by the subspace generated 
by the images of rt + 1 differential operators. 

Efforts to make these cohomology theories computationally effective have been extensive. 
Schoof's algorithm for counting points on an elliptic curve |54j (generalized by Edixhoven 
and his coauthors |22j to compute coefficients of modular forms) can be viewed in this light, 
using the theory of mod £ etale cohomology. A number of results on the p-adic side have also 
emerged in recent years. In early work. Wan |59j and Lauder and Wan [4T demonstrated 
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that the p-adic methods of Dwork can be used to efhciently compute zcta functions in small 
(fixed) characteristic. Lauder and Wan use the Dwork trace formula and calculate the trace 
of Frobenius acting on a p-adic Banach space, following the original method of Dwork and 
working on the "chain level" . In this paper, we instead work with the extension of Dwork's 
theory due to Adolphson and Sperber 3^ ; this point of view was also pursued computationally 
by Lauder and Wan in the special case of Artin-Schreier curves [481 149j . Under the hypothesis 
that the Laurent polynomial / is nondegenerate (see below for the precise definition) , the zeta 
function can be recovered from the action of Frobenius on a certain single cohomology space 
_ff"+i(f2). This method works with exponential sums and so extends naturally to the case of 
toric, affine, or projective hypersurfaces '4'. (It suffices to consider the case of hypersurfaces 
to compute the zeta function of any variety defined over a finite field using inclusion-exclusion 
or the Cay ley trick.) 

The method of Dwork takes into account the terms that actually occur in the Laurent 
polynomial /; these methods are especially well-suited when the monomial support of / 
is small, so that certain combinatorial aspects are simple. This condition that / have few 
monomials in its support, in which case we say (loosely) that / is fewnomial (a term 
coined by Kouchnirenko |42| ). is a natural one to consider. For example, many explicit 
families of hypersurfaces of interest, including the well-studied (projective) Dwork family 

Xq^^ H -I- a;""*"^ -I- XxqXi • • • a;„ = of Calabi-Yau hypersurfaces jl8j (as well as more general 

monomial deformations of Fermat hypersurfaces [19]) can be written with few monomials. 
In cryptographic applications, the condition of fewnomialness also often arises. Finally, the 
running time of algorithms on fewnomial input are interesting to study from the point of view 
of complexity theory: see, for example, work of Bates, Bihan, and Sottile [5]. 

To introduce our result precisely, we now set some notation. Let y be a toric hypersurface, 
the closed subset of G"j defined by the vanishing of a Laurent polynomial 

7= ^ a,x'^eFjx±]=FJxf,...,x±]. 

We use multi-index notation, so = x\^ ■ ■ ■ x^ . We sometimes write Z{f,T) — Z{V,T). Let 
A — A(/) be the Newton polytope of /, the convex hull of its support 

supp(7) = {i/ e Z" ; 7^ 0} 

in M". For simplicity, we assume throughout that dim(A) n. For a face r C A, let f\r = 
X^i/gr O'lyx'^ ■ Then we say / is (/S.-)nondegeneiate if for all faces r C A (including A itself), the 
system of equations 

j\t= xi— — = • • • = a;„- — = 

aXi OXn 

has no solution in F^", where is an algebraic closure of F^. The set of A-nondegenerate poly- 
nomials with respect to a polytope A forms an open subset in the affine space parameterizing 
their coefficients {a^),^^AnZ"- under mild hypothesis, such as when A contains a unimodular 
simplex, then this subset is Zariski dense. (See Batyrev and Cox [6] as a reference for this 
notion as well as the work of Castryck and the second author for a detailed analysis of 
nondegenerate curves.) We distinguish here between A(/) and Aoo(/) which is the convex 
closure of A(/) U {0}: for the Laurent polynomial wf in n -f 1 variables, / is A-nondegenerate 
if and only if wf is nondegenerate with respect to Aoo(/) in the sense of Kouchnirenko |41| . 
Adophson and Sperber [3], and others. Nondegenerate hypersurfaces are an attractive class to 
consider because many of their geometric properties can be deduced from the combinatorics 
of their Newton polytopes. 

Let s = # supp(/) and let U be the (n + 1) x s-matrix with entries in Z whose columns are 
the vectors (1, i^) G Z"+^ for E supp(/). Let p be the rank of U modulo p. Let v = Vol(A) — 
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n!vol(A) be the normalized volume of A, so that a unit hypercube [0,1]" has normalized 
volume nl and the unit simplex a — {{ai, . . . , a„) S M"q : < 1} has normalized volume 

1. 

We say that A is confined if A is contained in an orthotope (box) with side lengths &i, . . . , 6„ 
with hi - ■ - bn < n^v. We say that / is confined if A(/) is confined. A slight extension of a 
theorem of Lagarias and Ziegler |43) shows that every polytope A is GL„(Z)-equivalent to a 
confined polytope; this existence can also be made effective. (See section 3 for more detail.) 
In other words, for each Laurent polynomial / there is a computable monomial change of 
basis of Fq[a;*], giving rise to an equality of zeta functions, under which / is confined. (In the 
theorem below, at the expense of introducing a factor of log (5, where 5 — 6(S) — ma,x^^s W\ 
where \i>\ = max^ one can remove the assumption that A is confined.) 

For functions f,g : Z™q — > M>o, we say that / — 0{g) if there exists c e M>o and N E Z>o 
such that for every x — {xi, . . . ,Xm) € ^>Ar we have g{x) < cf{x). (The reader is warned 
that not all properties familiar to big-Oh notation for functions of one variable extend to 
the multivariable case; see Howell |29j . In fact, our analysis also holds with Howell's more 
restrictive definition, but we will not pursue this further here.) We further use the "soft-Oh" 
notation, where / = 0{g) if / = 0{g\og'^ g) for some k>l. 

Our main result is as follows. 

Theorem A. Let n E Z>i. Then there exists an explicit algorithm that, on input a 
nondegenerate Laurent polynomial f E ¥q[xf , . . . ,x^] with p>3 and an integer > 1, 
computes as output Z{f,T) modulo . If further / is confined, then this algorithm uses 

0(s r"/2l pjsjs jQgg ^ p^-P{QN + nYiv'^Nlog^ q)) 

bit operations. 

To recover the zeta function (as an element of Q{T)), if we fix both the dimension and the 
number s of monomials, we have the following result. 

Theorem B. Let n E Z>i and s E Z>„. Then there exists an explicit algorithm that, 
on input a confined, nondegenerate Laurent polynomial / E ¥q[x^ , . . . , x^] with p > 5 and 
s ~ #supp(/), computes as output Z{f,T) using 

bit operations. 

According to a theorem of Adolphson and Sperber [3], under the hypothesis that / is 
nondegenerate, Z(/, qr)^"^^" is a polynomial of degree v times Z(G"j, T)(~^)". Therefore, 
in the context of Theorem B, if p = Oivlogq) is small (or fixed), then our algorithm runs in 
polynomial time in the (dense) output size, which is the best one could hope for (aside from 
minimizing the degree of this polynomial). (The fewnomial input size, on the other hand, is 
O(slogwlog(7) for / confined and n fixed.) 

In our theorem, we require the dimension n to be fixed for several reasons. First, we employ 
well-known algorithms for lattice polytopes which have only been analyzed assuming that the 
dimension is fixed. (They further assume that arithmetic operations in Z take time 0(1), which 
is nearly valid in the usual bit-complexity model if A is confined and n is fixed; for a discussion 
of this point, see Section 3.) Second, it is often quite natural from a geometric point of view 
to consider the dimension to be fixed; one often considers families of hypersurfaces of a fixed 
dimension, for example. Finally, allowing fewnomial input and output and varying dimension, 
the problem of computing Z{f, T) is harder than the NP-complete problem 3-SAT (indeed, for 
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the latter one only wishes to know if #X(F2) > for an afhne hypersurface X of degree 2). 
For these reasons, we restrict our analysis (continuing below) to fixed dimension. 

Our method follows in the same vein as other recently introduced p-adic cohomological 
techniques. The methods of Lauder and Wan |47j mentioned above compute Z{f,T) for 
a polynomial / of total degree d using 0(p2»+4d3»'+9" log^"+'^ g) = 0((pu log g)3"+9) bit 
operations. The dense input size of / is 0{{d + l)"logq); consequently if the prime p (and 
dimension n) are fixed then their algorithm runs in polynomial time in the dense input size. 
Their method, although apparently not practical, is completely general and does not require 
any hypothesis on /. Our method can be analyzed on dense input (see Section 5) as well, 
running in time 0(p^"w^"~''* log^""*"^) with no condition on the number of monomials in the 
support of /. 

In a different direction, Kedlaya |34j (see also the presentation by Edixhoven 121] ') used 
Monsky-Washnitzer cohomology to compute the zeta function of a hyperelliptic curve of genus 
g over Fg in time 0{pg^log^ q). (Note here that the dense input size is 0{glogq).) This idea 
has been taken up by several others: see, for example, work of Abbott, Kedlaya, and Roe [1], 
who compute the zeta function of a projective hypersurface by working in the complement 
of the hypersurface and using Mumford reduction. (Indeed, Kedlaya has suggested that there 
should be a natural extension 37, of his ideas to the realm of toric hypersurfaces.) Our method 
also mirrors the algorithm of Castryck, Denef, and Vercauteren [101 . who tackle the case of 
nondegenerate curves. Their method has good asymptotic behavior but to be practical needs 
an optimized implementation |9l §1.2.4]. However, rather than following this vein and working 
with Monsky-Washnitzer (p-adic de Rham) cohomology, we employ the cohomology theory of 
Dwork, which has a more combinatorial flavor. 

In a yet further direction, Lauder has used Dwork's theory of p-adic differential equations 
to compute zeta functions using deformation |44l and recursion |45| . The Frobenius, acting 
on the members of a one-parameter family, satisfies a differential equation coming from the 
Gauss-Manin connection. Lauder uses this equation to solve for the action given an initial 
condition arising from the action on the cohomology of a simple variety which one can compute 
directly. Our method fits into this framework as it provides natural base varieties to deform 
from: indeed, the idea of deformation in the context of nondegenerate curves has been pursued 
by Tuitman [571. The methods of Lauder show that one can compute Z{V,T) for a smooth 
projective hypersurface V CP" of degree d with p \ d and nonvanishing diagonal terms in time 
p^((i" log g)'^^^-' . The deformation method has also been pursued fruitfully by Gerkmann [25) 
and others in different contexts. 

Adapting an idea of Chudnovsky and Chudnovsky and Bostan, Gaudry, and Schost 7 for 
accelerated reduction, Harvey [27j has improved Kedlaya's method for hyperelliptic curves, 
with a runtime of p^^^{glogq)^^^\ This approach appears to extend to higher dimensions as 
well, extending the method of Abbott, Kedlaya, and Roe |28j : his method appears to give a 
runtime of p^/^(d" log (7)"+'^*^^^ under a smoothness hypothesis analogous to the condition of 
nondegeneracy (but somewhat weaker). It would be interesting to see how his ideas for lowering 
the exponent on p might apply in our situation. 

This paper is organized as follows. In section 1, we introduce the cohomology theory of 
Dwork and give an overview of our method. In section 2, we discuss each step of the algorithm 
in turn: computing the splitting function and the Jacobian ring, the computation of Frobenius 
(where the condition of sparsity enters), and the reduction theory in cohomology. In section 
3, we give some algorithms for computing with polytopes. We then discuss running time and 
precision estimates for the complete algorithm in section 4. Finally, in section 5 we discuss the 
case p — 2 and consider some other possible modifications. We conclude in section 6 with some 
examples. 
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1. Overview 

In this section, we give an overview of our algorithm. Our introduction wiU be concise; for a 
more complete treatment of the theory of Dwork 17J, see Koblitz .40j . Lauder and Wan |47j . 
and Monsky [52] , 

In this section, we assume p > 2; see section 5 for a discussion of the case p = 2. 



Exponential sums 

Let / e Fg[a;j'^, . . . , x^] be a Laurent polynomial and let V C (Gm)]J be the toric hypersur- 
face defined by the vanishing of /. Let 9 : Fg C be a nontrivial additive character (with C 
a commutative ring of characteristic zero), so that 

e{x + y)=e{x)Qiy) 



for all x,y GVq. A point of departure for the theory of Dwork is the observation that for 

e 

J2 (^iw7m = 



x G F^", we have 



w£¥„ 



q, if fix) = 0; 
0, otherwise. 



Consequently 

g#F(F,)= J2 0(^7(^))= E e{w7ix)) + iq-ir. (1.1) 

In other words, counting the set of points V{¥q) can be achieved by instead evaluating an 
exponential sum (on either x G"^ or G"^^). 

For r G Z>i, we define a system of nontrivial additive characters 0,- : F^r — > C by 8^ = 
Q o Tr^ where Tr^ : ¥qr — > F^ is the trace map, and we define the exponential sums 

Sriwf,G:+')= J2 ^riwJix)). 

The L-function associated to wf over G'^'^ is defined to be 

L{w7,G^:^\T) ^ exp (^g Sriw7,G-t') ^,:^ 

Then by (11.11) we have 

Z(V, qT) - L{wJ, r)Z((G:^)F, , T), (1.2) 

where 

n 

^(raF„r)(-i)""^ ^ -Q(i _ q^TpY-^y. (L3) 

i=0 



The Dwork splitting function and interpolation 

Complex characters are defined via the exponential map, but the theory takes off when the 
ring C where the character takes values is a p-adic ring, and the exponential function does 
not have a large enough p-adic radius of convergence to be useful. We improve this radius of 
convergence by using a modified exponential function as follows. Let n be an element of the 
algebraic closure of Qp that satisfies tt^"^ — — p; then Zp[7r] = Zp[Cp] where (^p is a primitive 
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pth root of unity. We define tfie function 

0{t) = exp Lt + = cxp{7T{t - tP)) = e Qp[7r][[<]] 

^ ^ ^ i=0 

which is called a Dwork splitting function. It is sometimes denoted by 9i{t) to distinguish it 
from other splitting functions. We have 

ordpA, > (1.4) 

where ordp is the p-adic valuation normalized so that ordpp = 1; thus, in fact 9(t) G Zp[7r][[t]]. 
We observe that 9{1) = 1 + tt + 0{7t^) is a primitive pth root of unity, and so we obtain om- 
additive characters via the maps 

: ¥qr Zp[n] 
e,(a;) = 0(l)(T'-°T'v)(J) 

where Tr o Tr^ = Trp^^/F^ is the absolute trace. 

The values of the characters 8^ can be p-adically interpolated in a way consistent with field 
extensions, as follows. Let Qq be the unramified extension of Qp of degree a — logp q, and let 
^9 Q Qq denote its ring of integers, so that Zq is the Witt vectors over F^. Let a : Zq 'Lq 
denote the p-power Frobenius (lifting the pth power map on the residue field Fg.) There is a 
canonical character w, called the Teichmiiller character, of the multiplicative group F^ taking 
values in Zg that takes an element 3; G F^ to the element x (E Zq, satisfying x"^ = x and such 
that X reduces to x m¥q. For such a Teichmiiller representative a; G lifting x, we find that 

a-1 

ei(x) = Y[e{xP') ^ e{x)e{xP) ■ ■ ■ eix'i/p) eZpin]. 

i=0 

and extending this for r G Z>i we have 

ar—l 

Orix) = Y[ e{xP' ) G Zp[7r] 
1=0 

for X G WqT and x a Teichmiiller lift of x. Let a : Zq Zq by x i—i' x"' denote the p-power 
Frobenius, the ring automorphism of Zq that reduces to the map x t-^ x^ modulo p. Then 

ar — 1 

Qr{x) = n ^(^"')- (1-5) 

i=Q 

We now consider these character values applied to values of our Laurent polynomial /. 
Write f{x) = ^^a^x'' G Fg[a;^] in multi-index notation; we assume that each 7^ 0. Let 
fi^) = Uijx'^ G Zq[x^], where a^, is the Teichmiiller lift of a^. In light of (|1.5|) . we are led 
to consider the power series 

F{w,x) =Y[e{wa^x'') G Z,[7r][[w,x±]] 

and 

a-1 

where denotes the power series obtained by applying <j to the coefficients of F . (The 
abuse of notation which identifies a power series and its specializations will only occur in this 
paragraph.) It then follows from (|1.5p and a straightforward calculation that 

e^(uy7(x)) = i^f"^' (u), x)i^('^) («;«, a;9) • • • F^''^ («;«""', x"""' ) G Zp[7r] (1.6) 
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for all {w, x) € WqT x F^r", where {w, x) denotes the Teichmiiller lift. We have thereby extended 
the interpolation of the values of / to the power series (|1.6p . 

Dwork trace formula 

So far, we have related the zeta function to the L-function of an exponential sum via a p-adic 
additive character arising from the Dwork splitting function, and we have interpolated these 
character values in a power series F = F{w, x). In order to move this to cohomology, we define 
a space of p-adic analytic functions like F with similar support and p-adic growth. 

Let A — A(/) be the Newton polytope of /, the convex hull of its support 

supp(7) = {i^e Z" : ^ 0}. 
For d e M>o, let dA = {dz G R" : z e A} denote the dth dilation of A. Let La be the ring 

La = < ^ ^ Cd.^w"^ x"" : Cd.i, e Zqiir] a.nd ordp{cd,,,) > d^^-Y^\ . (1.7) 

The estimate (|1.4p implies that F £ La, and so multiplication by F defines a linear operator 
which we also denote F : La La- 

On the space La, we have a "left inverse of Frobenius" ip : La — > La defined by 

/a-i(cd,.)u;'*/fWP, ifp\dandp\iy, 
I 0, otherwise; 

in multi-index notation, the condition p \ v means p \ Vi for all i = 1, . . . , n. The map f/' is 
tT~^-semi-linear as a map of free Zg-modules. 

Finally, let a = o F and aa = ip"' o F^°'\ Then Ua is Z^-linear, and another calculation 
reveals in fact that aa = a" (composition a times). 

The Dwork trace formula [16j then asserts that 

Sriw7,&:^+')^{q'- -ir+' Trial). 

It follows [3J that 

n+l 

L{w7,G-J\T)(-'^" = n dct(l - iq=T)ai \ La)^-'^^"^'). (1.8) 

The equality (|1.8p expresses L{wf, GJJ+^, T) via the action of aa and its powers on an (infinite- 
dimensional) p-adic Banach space; this is the point of departure for Lauder and Wan in their 
work |47| . 

We now proceed one step further and move to the level of cohomology. 
Dwork cohomology 

We now consider a Koszul complex fl' associated to wf as follows. To ease notation in this 
subsection, let xq = w. For i = 0, . . . , n, let 

/.=..^ (1.9) 

OXi 

and define the operator Di : La — >■ La by 

d 

Di = Xi- h 7r/i 

OXi 
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(the latter is the operator given by multipUcation by Trfi). The operators Di commute. For 
k = 0, . . . ,n, let 



0<Ji<-<jk<n 



dx-i 



dx i 



Let be the complex 
with maps 



o^n° ^ > ^ 



— - A — A • • • A — - 



for ^ e La- 

Now a induces a map on the complex 



— 



^f^o 



ri+l 



^ 



(1.10) 



since one checks that aDi — pDia for all i. (One similarly has a map induced by tta, replacing 
P by q.) 

Then l3| we have 



n+l 



L(u;/,(G:^+\T)(-i)" - n det(l - | Win))^-'^"^ ~\ 

3=0 

The condition that / is nondegenerate simplifies the expression (|l.lip . as we now see. 



(1.11) 



Nondegenerate 

We recall our notation from the introduction. For a face t C A, let f\r — X^i/st Then 
we say / is nondegenerate if for all faces r C A (of any dimension, including A itself), the 
system of equations 



f\r = X- 



df\r 

dxi 



df\r „ 

Xn ^ = 

OXn 



has no solution in , where is an algebraic closure of Fg. 

Suppose / is nondegenerate. Then all the cohomology spaces H^{Q. ) are trivial except for 
k = n + \ . Let 

B = ^ ^ H^+\n'). 

Then by work of Adolphson and Sperber 1^, the Zg[7r]-module B is free of dimension 
n! vol(A) — V (equal to the normalized volume of the cone over A in R"+^) and 

L{wJ,Gl]+\T)<^-^^" = det(l - aaT \ B) e 1 + TZ[r] (1.12) 

(using ([13) ). 

Let A (resp. Ag) be a matrix of a (resp. Ua) acting on B. Then we have 



An = AA" ' ■■■A" '° " . 



(1.13) 



where A'^ denotes the matrix where a is applied to each entry in the matrix. 
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An overview of the algorithm 

We now describe how to effectively compute the terms in the formula (11.121) , and in particular 
the matrix A (I1.13p . We sketch an overview and wait to describe each of these steps and their 
running time in detail in the sections that follow. 

The algorithm takes as input a nondegenerate (confined) Laurent polynomial / and a 
precision N G Z>o, and it produces as output the polynomial det(l — aaT \ B) modulo . 
For N large, we recover the coefficients in Z and then from (|1.2I) we recover Z{f,T). 

Let R — 'Zq/p^ . (By carefully factoring out the algebraic element n, we may work in this 
smaller ring; see Lemma 12.121 and the accompanying discussion.) 

In (|1.7p we have worked with the power series ring La, but by the convergence behavior of 
elements of Xa, working modulo p^ these power series become polynomials. So we define 

oo 

R[wA] = ^ R[wA]d 

d=0 

where 



R[wA]d = Rw'^x". 



The ring R[wA] is the monoid algebra arising from the cone over A with coefficients in R, and 
it is naturally Z>o-graded by w. Recall (jl.9p that we have defined 

f 

Ji = WXi — 
OXi 

for i = 1, . . . , n (and /o = wf). 
Our algorithm has 4 steps. 

1. Compute the Teichmiiller lift / of /. Using linear algebra over R, compute a monomial 
basis V for the Jacobian ring 

J = R[wA]/{wf,wfu...,wU). 

(The monomial basis V for J yields a basis for B.) 

2. For each monomial m £ V, compute the action of the Frobenius a{m) using "fewnomial 
enumeration" . 

3. For each m d V, reduce a(m) in cohomology using the differential operators Di to an 
element in the i?-span of V. (The matrices implicitly computed in Step 1 are used in 
this reduction.) 

4. Compute the resulting matrix A — a \ B modulo p^ , then compute Aa using (jl.lSp and 
finally 

ziv, qT) = det(i - rA,)(-i)"z((G;;)F„T). 

Output Z{V,T). 



2. The algorithm 

We now describe each step of the algorithm announced in our main theorem and introduced 
in section 1. We retain the notation from section 1. 

Step 1: Computing the Jacobian ring 

We begin by describing the computation of a basis of the Jacobian ring 

R[wA] 



J 



{wf, Wfi,...,wfn)' 
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as result of this computation, we also obtain matrices which will be used in the reduction step 
in Step 3. 

Lemma 2.1. If f is nondegenerate, then J is a free R-module with basis of cardinaUty 
V = Vol(A) consisting of monomials with degree < n + 1. 

Proof The proof of Monsky '52' (see work of Adolphson and Sperber fS', Appendix]) shows 
that under the nondegeneracy hypothesis, the associated Koszul complex is acyclic modulo p 
and therefore lifts to an acyclic complex over R (as the modules are complete, separated, and 
flat over Zg). □ 

To compute with the ring i?[wA], we need some standard algorithms for computing with 
polytopes. For a set C Z", we denote by A(S') its convex hull. 

Lemma 2.2. There exists an efficient algorithm that, given a finite set 5' C Z", computes 
the set Z"n A(5). 

We discuss this algorithm and its running time in detail in the next section (Proposition 
13. 5p : it will be treated as a black box for now. 

Let ^ be a term order on the monomials in i?[wA] that respects the w-grading. We begin 
by computing in each degree d = 0, . . . , n + 2 the set of monomials in dA using Lemma 12.21 
and we order them by ^. Then, for each such d a spanning set for the degree d subspace of 
the Jacobian ideal, {wf, wfi, . . . , 'wfn)d, is given by the products of the monomials in (d — 1)A 
with the generators wf, wfi , . . . , wf„ of the Jacobian ideal. Finally, for each d, let Jd be the 
matrix whose columns are indexed by Z" H dA, i.e. the monomial basis for R[wA\d ordered by 
-<, and whose rows record the coefficients of the spanning set for {wf, wfi, . . . , wf)d- 

We then compute the row-echelon form Aid = T^Jd of Jd for d = 0, . . . ,n + 2 using linear 
algebra over R. According to Lemma 12.11 every pivot in Md can be taken to be a unit in R; 
thus, a monomial basis V for J is then obtained as IJ^ Vd where Vd is a choice of monomial basis 
for the cokernels of Md- In particular, the matrix Mn+2 has full rank and so has a maximal 
square submatrix with unit determinant. 

Step 2: Computing the action of Frobenius 

Recall we have defined the Dwork splitting function 

oo 

0{t) = exp(^(t - tP)) = ^i^' ^ 2pH[W] 

1=0 

with ordp A; > i{p — l)/j3^. The image of 6{t) modulo in i?[7r][[i]] is a polynomial of degree 
less than Np'^/{p - 1) = N(p + 1 + l/(p - 1)). We compute 6{t) = cxp(7ri) exp(-7riP) as the 
product of two polynomials of this degree. 

Remark 2.3. Here we have used that p is odd. For p = 2, the splitting function 9{t) 
above does not converge sufficiently fast for the algorithm described below to work. For the 
modifications necessary, see section 5. 
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We have / — o.ux'^ £ Zq[x^] with 7^ and v E and 

F{w,x) ^JJeia^wx"). (2.4) 

One option is to muhiply out the product (j2.4p naively. Instead, we seek to take advantage 
of the fewnomialness of /. We have 

F(w, a;) = n e(a^wx'') =[1(1 + Ai(a^wa;'') + • • • + X^ia^wx^^y +...). (2.5) 

Let supp(/) — {v\^ . . . , and abbreviate = a^. . Expanding (12. 5p out we obtain 

F(z«,a;)= ^ (Ae,---AeJ(ar (2.6) 

(ei,...,es)GZ|Q 

We make further abbreviations using multi- index notation as fohows: for e G Z?,q, which we 
abbreviate e > 0, we write Ae = Ae^ • • • Ae^ and — a^^ ■ ■ ■ a%' , and we write |e| = ei + • • • + Cs, 
and finally ev — e\V\ + • • • + tsVs for the dot product. Then (I2.6p becomes simply 

F(w,a;) =^Aea'=u;l'=la;"^ (2.7) 

e>0 

Let w'^x^ G V . Then from ([^77]) we have 

a(u;'^x'^) = (V' o F)(u;'^x'^) = ^ Aea-i(a")w(l'^'+'')/fx("'^+")/P £ Xa- (2.8) 

e>0 
p|(ei/+/i) 
Pl(|e|+d) 

The set of indices for the sum on the right side of (|2.8p is then contained in the set 

Ed,ii, = {e G Z>Q : e^i^i = — /i^ (mod p) for all i and |e| = — d (mod p)}. (2-9) 

When the set Ed^fi is proportionally fewnomial relative to the set of all monomials, we can 
hope to be able to enumerate it faster than multiplying out F completely. 

Remark 2.10. On the other hand, if / is dense, in general we gain no advantage using this 
approach, as can be seen by the following example. Let / be a generic univariate polynomial of 
degree s, so that n = 1 and supp(/) = {0, 1, . . . , s}, and let /i G Z>o. Then there is a bijection 
between -Ed./i and the set of all (integer) partitions of d = /i (mod p) into parts of size < s. 
Since the number of such partitions grows exponentially with d, enumerating all such partitions 
would be prohibitively time consuming. 

We can compute the set -Ed,/^ by considering the corresponding set of linear equations modulo 
p. Let U denote the (n + 1) x s matrix whose columns are the vectors (1, z^^)*, and let 

K = K{d, fi) ={e:Ue= (d, /x)* (mod p)} C (Z/pZy. 

We identify K with its image in ZS^q by taking the smallest nonnegative residue in each 
component. Then (|2.9p becomes simply 

Ed^^ = K + {pZ>oy . 

Let p be the rank of U modulo p. Then — p'^^'' , since by our assumption dim(A) = n so 
s > n + 1. 
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Rewriting (j2.8p . with this notation we obtain 



pe\^{\k\+p\e\+d)/p^{(k+pe)u+fi)/p 



keK e>0 

(2.11) 



= J2 cT-i(a^-)«;(l^-|+'^)/^'x('='^+^)/P ^Afc+pea' 

keK \e>0 

Here we used the fact that o'^^((a'')^) = since a is a Teichmiiller element. We then compute 
a{w^x^) using the formula (|2.1ip . 

We make one final substitution, which will simplify the reduction: we carefully factor out 
the algebraic element tt (satisfying ttP'^ — —p) as follows. 

Lemma 2.12. We haveordpX, = i/{p- 1) G Q/Z. 

Proof. The ith coefficient of both exp(7rt) and exp(— Trt^) satisfy the congruence, and 
consequently the same is true of the product. □ 

Therefore, in the expansion 9{t) = ^ Xif we write Ai = TT^£i so that £i £ Qp and 

ord,(^.) > z - = (2.13) 

\ p-lj P^{p- 1) 

Although we introduce some denominators here, they are controlled. Extending our multi-index 
notation, we obtain 

aiw'^x^') = fT"'(a')(™)(l"l+'''/%("''+'')/P I ^7rl'=l+fl^l-l^l-(l'=l+'^)/f4+pea^(™)l^la;^'' 

keK \e>0 

Since tt^^^ = — p, we write 

|fc| + p\e\ - |e| - (|fc| + d)/p ={p- l)\e\ -d+{p- l){\k\ + d)/p. 

Thus 

(\k\+d)/p( ^{\k\+d)/p (kv+p)/p 



aiinwfx'') = ^ a-i(a'=)(-p)(l'=l+'*)/''(7ru;) 



x"- 

keK 

(2.14) 

Y,(-Pph+pea'{7rw)^^^x'''' 

(We introduce the power tt'' to simplify this expansion; we may then just divide out at the 
end.) 

Step 3: Reducing in cohomology 

Our goal in this step is to determine the characteristic polynomial of Frobenius a acting on 

H^+\n)^LA/EtoD^LA 

modulo p^ , where N is the desired precision. The preceding analysis, including Lemma 12.121 
and particularly the expression (j2.14p . shows that a acting on {-kwYx^ with v € dA is a series 
in terms of the form (ttwYx^ with /i G eA and coefficients in Zg. The p-adic behavior of 
Frobenius assures us that the coefficients of {ttwYx^ tend to in as |e| — >■ oo. Therefore, 
modulo p^ , the image a{{'Kw)'^x'^) is a polynomial G G i?[(7ru')A], a linear combination of 
terms {iiwYx^^ {fi G eA) with coefficients in Iqjp^ . Since the factor tt enters only formally, we 
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may suppress the tt factor and view the resultmg polynomial in i?[i(;A]. From Lemma \TT\ we 
work with coefficients in Zq/p^ , rather than using the iterative reduction method described 
by Adolphson and Sperber [3| Theorem 2.18]. 

Therefore, let G € R[{Trw)A]; we show how to reduce G in cohomology. Let lm(G) be the 
leading monomial (highest degree monomial) in G with respect to ^. First suppose that the 
degree of lm(G) (in w) is at least n + 2. Let mg be a monomial in Rn+2 that divides lm(G) 
(a precise choice will be given in the next section), and let m = lm(G)/mo. Let G^™-* consist 
of those terms in G in mR[wA]n+2 and identify ^ — G^-^^^ /m C R[wA]n+2 with a row vector 
indexed by the monomials of R[w A]n+2, each containing the term (nw)"^'^ by our convention. 

Let rj = ^T(i. Since Jn+2 is of full rank, we have 

Thus, with the natural identifications, we have written 

£, = ?7o(7ru;)/o + ?7i(7ru')/i H h rin{TTw)fn (2.15) 

with 770, . . . ,?7n £ i?[(7rw)A]„4.i = 7r"+^i?[wA], and therefore 

(TO?7o)(7rw)/o + • • • + imT]n){TTw)fn = = G^™'. 

Recall that we work in the module B — L/ ^^DiL, where Di — Xid/dxi + {'Kw)fi (and we 
again set w — x^ for convenience): this implies that in B, we have the relation 

Note now that all terms in the reduction (j2.16p have degree one smaller than that of G^"^K We 
then iterate this procedure on the leading monomial in G — G'™-* until it is equivalent in B to 
a polynomial of degree < n + I. 

So now assume that G has degree d < n + 1, and let Gd be the degree d terms of G. We 
repeat the above procedure, with £, = Gd- we compute rj = ^Td and let Vd — ^ — rj. Note that 
Vd is in the span of V by construction. Then by a similar calculation as in (|2.15p . we have 
i = Vd + J2i=o m{'^w)fi and so 

Gd = (,=Vd-^Xi e B. 
Completing the final iteration on decreasing d, we obtain G = X]d=o £ B written in the span 

of y. 

Remark 2.17. In this reduction theory, we never perform a division: we simply reduce the 
power of TTw, as we have written G as a polynomial in nw and x. Consequently, we do not lose 
precision in our analysis. 

Step 4: Output 

To conclude, we assemble the reductions of a{w'^x'^) for w'^x'^ e V into a square matrix 
A = a \ B oi size v x v, where v — #1^, with coefficients in R. We then compute 

Aa ^ AA"'' ■ ■ ■ A'''^''''\ 

We then compute det(l — TAa) using standard methods (analyzed in section 4), and we recover 
the zeta function from (|1.2p - (|1.3p : we compute 

z(v, qT) = det(i - rAj(-i)"z((G;^)F,, T) 

from which we easily obtain the desired output Z{V,T). 
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3. Some algorithms for polytopes 

In this section, we describe some methods for computing with polytopes which are used as 
subroutines. 

Confined polytopes 

By polytope we will always mean a lattice polytope. Let A C M" be a polytope with dim A = 
n and normalized volume v = Vol(A). (If dim A < n then by restricting to the linear space 
that contains A one can make appropriate modifications to the algorithms below.) The group 
GL„(Z) acts on M" preserving the set of polytopes; we say two polytopes A, A' are GL„(Z)- 
equivalent if there exists U S GL„(Z) such that U{A) — A'. 

By work of Lagarias and Ziegler |431 Theorem 2] , any polytope A C M" is GL„ (Z)-equivalent 
to a polytope contained in a lattice hypercube of side length at most nv. We now prove a slight 
extension of this result using their methods. 



Lemma 3.1. Any polytope A C R" is GL„(Z)-equivatent to a polytope contained in a 
lattice orthotope (box) with side lengths &i, . . . , 6„ satisfying &i • • • 6„ < 



Proof We follow the proof of Lagarias and Ziegler |43l Proof of Theorem 2] (working 
always with normalized volume); for convenience, we reproduce the main idea of their proof. 
First suppose that the polytope is a simplex S having vertices vq, . . . ,Vn € Z". Then the lattice 
A spanned by the basis vectors Wi = Vi — vq is a sublattice of Z" with det(A) = Vol(A) = v. 
There is a matrix U E GL„(Z) taking the matrix B whose column vectors are Wi to its Hermite 
normal form 

fan ... \ 



UB = 



021 022 



\Oti1 0„2 







7 



with < aji < an for j > i and an > for all i and aji = for j < i. Now det(A) = | det(i?)| = 
nr=i ~ ^ ^^'^ hence the parallelopiped generated by the row vectors of t/i? is contained in 



the orthotope 



S = {x e M" : < < a,;, for 1 < i < n} = [0, an] 



[0, o„ 



The simplex UT, is contained in this parallellopiped and hence also the translated orthotope 

Suppose A is now an arbitrary polytope. Then |431 Theorem 3] there exists a maximal 
volume simplex E C A, and moreover A is contained in the simplex —nT, + (n + l)z where 
z = Y^^=o centroid of E and (n + l)z G Z". By the above, E is GL„(Z)-equivalent to 

a simplex contained in an orthotope S and hence A is equivalent to a polytope contained in 
the orthotope —nE. + [n + l)z with side lengths 6i, . . . , 6„ with bi = nan so IliLi ~ 'ti'^v, as 
claimed. □ 



We say that A is confined if A is contained in an orthotope (box) with side lengths 6i, 
with 6i • • • 6„ < ri"w. We say that / is confined if A(/) is confined. 



Remark 3.2. The problem of finding a maximum volume simplex, the key to making the 
proof of Lemma 13.11 computationally effective, has been studied in detail (see, for example, 
work of Gritzmann, Klee, and Larman [26] ). To avoid going too far afield into the field of 
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computational geometry, in this article we accept any input Laurent polynomial / but only 
analyze the runtime when / is confined. 

Corollary 3.3. We have #(A n Z") < {2nYv. 

Proof. The action of GL„(Z) preserves lattice points, and so if S is the orthotope given by 
Lemma 13.11 then 

#(A n Z") < #(S n Z") = (5i + 1) • • • (6„ + 1) < (26i) • • • (26„) < 2"(n"i;) 
as claimed. □ 

Remark 3.4. Corollarv 13.31 is in some sense best possible, since equality holds for A = 
[0, 1] C K. For fixed n, one can do no better than #(A n Z") = 0{v) since for any polytope 
A we have #((iA n Z") ~ Vol((iA)/n! as d — >■ oo. For a given n, one can improve the constant 
(2n)" significantly working in a more general context (see Widmer [62]). 

Enumerating lattice points 

Our next task (arising in Step 1) is to exhibit an algorithm to enumerate the lattice points 
in the convex hull of a set of lattice points. 

Let S C Z" be a nonempty finite set. Let A — A{S) denote the convex hull of S. Let 
V = Vol(A) and s = #5'. Suppose dim(A) = n. Finally, let S — S{S) = max^^s W\ where \v\ = 
maxj Ivi]. Then the bit size of S is 0{s log 6). 

Proposition 3.5. Let n € Z>o. Then there exists an algorithm that, given a finite set 
S C Z", computes the set Z" nJ^{S) in time 0(s r"-/2l \ogS + vlogS). In particular, if A is 
confined then this runs in time 0(sr"/2l +v). 

Remark 3.6. If A is confined, and contained in a orthotope S, then A H Z" C S H Z" and 
the latter has cardinality 0{v), but still one needs to test if each such lattice point is contained 
in A. The exponential contribution of the first term comes from the combinatorics of A, which 
in general can be quite involved. 

The proof of this proposition combines several well-known results in computational geometry. 
We refer to the book of Preparata and Shamos |53| and the articles by Seidel |55| and Fortune 
[23] and the references contained therein for more detail. 

The first main step is to compute the Delaunay triangulation of S. We lift each point 

e 5 C Z" to (i^, e Z"+i where = lyf -\ + ly^. Then the convex hull of the lifted 

vertices has simplicial faces (under mild assumptions that can be achieved under a suitable 
perturbation), and projecting the simplicial faces onto the original vertices yields a triangu- 
lation. There are many (deterministic) algorithms to compute the Delaunay triangulation: we 
will invoke one method, called the incremental algorithm, with optimal deterministic variant 
due to Chazelle [12] . 

To give a brief outline of this algorithm, we follow the overview given by Seidel |55l 19.3.1]. 
The incremental algorithm orders the set S = {vi, . . . ,Vs\ and incrementally computes A; = 
A(5'i) from Ai_i, where Si = {vi, . . . ,Vi} . The description of A^ is by its facet description, the 
set of all facets specified by their defining linear inequalities. A facet of Ai_i is visible from 
i>i if its supporting hyperplane separates Ai_i and i>i, otherwise we say the facet is obscured. 
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Updating Ai_i to involves finding (and deleting) all facets visible to Vi, preserving all 
obscured facets, and adding new facets with vertex Vi. We record these steps keep track of the 
triangulation created in this way by updating the facet graph. 

From this description, it is clear that the integer operations involve only computing and 
checking linear inequalities defining facets arising from the convex hull of subsets of points of 
the form {v, \\v\\'^) of cardinality n. By Cramer's rule, the linear equalities defining such a facet 
have coefficients that are bounded in size by [n + l)\5^{n5'^) = 0(5"+^), so the largest integer 
operation can be performed in time 0{{n + 2) log 5) = 0(log i5) using fast integer multiplication 
techniques. (The bit arithmetic is also analyzed by Fortune [241 4.7].) 

The incremental algorithm requires 0(slogs + sL("+i)/2J) = ©(sT"/^!) integer operations (if 
n is fixed), so in fact the total number of bit operations is 0(sl^"/^^ log 5). 

Remark 3.7. Having computed the Delaunay triangulation, in fact we have computed 
a complete facet description of the convex hull A — A(S'); if desired, one can compute the 
complete face lattice (in particular, the set of all vertices) in the same running time ^55j . 

Having computed the Delaunay triangulation, we can list lattice points by doing so in each 
simplex. The problem of enumerating lattice points in an (integral) simplex is analyzed by 
Bruns and Koch j8j in the context of computing the integral closure of an affine semigroup. 
Let S C A be a simplex of the Delaunay triangulation of A defined by wq , . . . , and let 
Wi — Vi — vq. We compute the Hermitc normal form of the matrix B with columns Wi (which 
can be done using 0{\ogS) bit operations in fixed dimension n |31l I15|): let w'^ be its 
columns. We then enumerate all elements in the simplex, each represented by a lattice points 
in the fundamental parallelopiped Z"/(X)i^^Di using 0(Vol(I]) log 5) bit operations. Since 
the simplices E triangulate A, putting this together with the first main step, we have proven 
Proposition 13.51 



4. Precision and running time estimates 

In this section, we discuss precision and running time estimates for each of the four steps of 
our algorithm. We suppose throughout this section that / is confined. 

Step 1: Computing the Jacobian ring 

First some basics. A ring operation in i? = Zg/p^ can be performed using O(iVlogg) bit 
operations using standard fast multiplication techniques. The Teichmiiller lift of an element of 
¥g to R can be performed using 0(log N) Hensel lift iterations (Newton's method in this case 
reduces to the iteration of the map cc i— >■ cc*); each iteration can be performed in time 0{N log^ q) 
by repeated squaring, for a total of 0(A'^logiVlog^ g) = 0{N log"^ q) bit operations; the time 
to compute the lift / we will see is negligible. Similarly, the time to compute the partial 
deriviatives wfi = wdf /dxi is negligible. 

We now compute the monomial basis V for the Jacobian ring J, as described in Section 
2. By Proposition 13.51 we can compute Z" n dA for d = 0, . . . ,n + 2 in time 0(5^"/^! + v): 
because we have fixed the dimension, we have Vol(dA) = d" Vol(A) — 0{v) for such d. (Given 
the inequalities defining A, we could also simply compute Z" n (rt + 2)A and then find Z" n dA 
by testing the scaled inequalities.) 

We now analyze the computation of the row-echelon form = TdJd for d ~ 0, . . . ,n + 2. 
The matrix Jd has 0(d"v) columns indexed by Z" n dA and 0((n + l)(d - l)"u) = 0((d - 
l)"u) rows indexed by (wf,wfi^...^wfn)d- The row echelon form of an r x s-matrix can 
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be performed using 0{r^s) ring operations using standard techniques, so we can compute 
Md = TdJd in time 0{{d — ring operations and so for c? = 1, . . . , n + 2 using 

0{n^''v^N log q) = 0{v^N\ogq). 

bit operations. From this, we compute the basis V . 



Remark 4.1. In practice, it may be more efficient to use Grobner bases to compute the 
basis V and capture the effect of the reduction matrices Md- (In particular, the Faugere's F4- 
and i^s-algorithms would be quite useful.) For simplicity, we take the direct approach using 
linear algebra. 



Step 2: Computing the action of Frobenius 

First, we compute the image of the Dwork splitting function 

00 

0{t) = exp(7r(i - tP)) = irH.e 

i=0 

modulo . We have ordp(7r*£i) = ordp(Ai) > i(p — l)/p^, so modulo the image is 
polynomial of degree less than iVp^/ {p — 1) = 0{pN). Recall (j2.13p that 



ordp(^i) > -d{p,i) = - 



»(2P"1) 
p'^ip- 1) 



We write each £i to precision N as an element of the module p-'^^P'^^ (Zq/p^+''(P'*)) . The largest 
such denominator in our expansion is bounded by 

<i(p,».V(.-l))< (^)^^A'(f^ = OWp). («) 

Therefore we compute 9{t) with coefficients in Zg/p^^ where M — N{1 + {2p — l)/(p — 1)^) = 
0{N), then compute each £i — Xi/n^. 

To compute 9(t) modulo p^ , we multiply exp(7rt) with exp(— tt^^) truncated to degree 
Np^/ij}— 1). The latter factor exp(— Trt^) has < Np/{p— 1) terms, so multiplying the two 
can be done in time 0{{N^p^/{p- lf)iM\ogq)) = 0{pN^\ogq). 



Remark 4.3. Note this computation only depends on p and A'^ and does not depend on /. 
Given a monomial w'^x'^ e V, we compute the action of Frobenius a using (j2.14p : 
aiiTTVufx^') = J2 'T-\a'')i-pf'\+^^/P{7rwY\''\+'^'^/Px^^''+^''^/P I ^(-p)l^l4+pea'(™)'''x^" | 

keK \e>0 J 

This is computed to precision N, interpreted as above: to multiply £i times £j we add 
the exponent of the denominators in p and multiply the numerators as usual; since these 
denominators are bounded, the extra arithmetic with denominators is negligible and such a 
multiplication can be performed using 0{M \ogq) = 0{N \ogq) bit operations. 

Recall Remark 12.171 in the upcoming reduction step (Step 3), we lose no precision as no 
divisions occur: we only reduce the power of nw. Therefore, to have the Frobenius expansion 
correct to precision N, we just analyze the convergence to of the term 
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Using (|2.13p . we have 

Ordp(Cfc,e) = 0rdp((-p)l'=l + (l'=l+'^)/P4i+pei • • • 4.+peJ 

\k\+d I, 2p-l 

>^ + N-(W+^'N)^2(^ (4.4) 

>(|fc|/p + N) 

V P^~pJ P 

Here and from now on we insist that p > 2: then in (j4.4p . we have ordp(cfc_e) > 0. In 
particular, then, the coefficients are all integral (this was guaranteed by the reduction theory 
of Adolphson and Sperber [3]); so even though we have (temporarily) written the values £k+pe 
with denominators, the power of p multiplies through to make them integral. Also note for any 
basis element {tt'w)'^x^^ £ V, we have d < n + 1 = 0{1). Let 



(3 = Pip) - 1 - 



2p—l\ ^ P^ — P 



p^ ~ P J p^ — 3p + I 

For p = 3 we have /3(p) — 6, but for p > 5 we have (3{p) < 20/11, and (3{p) — 1 as p — ?> oo. In 
(|2.14[) , we have multiplied through by tt'^ for uniformity in the expression, so we must compute 
to this extra precision: so let 

n + I d d d 

7 = 7b>" = — > — = 7 - -■ 

p'' — p p'^ — p p — 1 p 

Note that ^{pjn) < n. 

Then (|4.4p implies ordp(cfc^e/7r'^) > |e|//3 — 7, since > 0. Thus to have the answer correct 
to precision N we only need to worry about terms with |e| < (3{N + 7) = The set of e G ZS>q 
with |e| < E has cardinality OiE" /si) = OiE"); since — p^~'^ , the number of terms in the 
expansion of a{{'Kw)'^ x^) is 0{p'^^''E^). 

We compute the terms in the sum indexed by K. We can compute — ■ ■ ■ 0^ = 
using 0(s(log |fc|)(iVlogg)) = 0(sA^log^ q) bit operations, since \k\ < ps; cr~^(a'^) = (a'^)''/^ by 
repeated squaring is computed using 0{N log^ q) bit operations. The exponent arithmetic in the 
power of X is negligible up to logarithmic factors. Therefore, we can compute the p^^P 
terms in the sum indexed by K in time 0(p'^~''sA^log^ q). 

Now we compute the terms in the inner sum. The value £k+pe — ^ki+pei ■ ■ ■ ^ks+pcs can 
be computed using 0{s) multiplications or 0{sN\ogq) bit operations. We compute the 
values a'^ recursively, so that each term (ordered by ^, increasing |e|) requires only one 
additional multiplication. Therefore the 0{E^) terms in the inner sum can be computed in 
time O [E' [sN log q)). 

Putting these together, the total product can be computed using 

6{p''^PsN\og^q + E''{sN\ogq)+p'-PE^N\ogq)) 

= dip'-P sE' N \og^ q) = Oip'-pp'iN + -()'N\og^ q). 
bit operations for one monomial in V . 

Step 3: Reducing in cohomology 

We now reduce the elements G — a{{TTw)'^ x^) G L computed in Step 2. We begin by reducing 
the degree of G, as explained in Section 2, using multiplication by the matrix T = Tn+2, until 
this degree is < n + 1; then we complete the reduction by multiplications by the matrices Td 
with d = n + 1, n, . . . , 1, 0. The number of such reductions in any fixed degree d > n + 2 is 
governed by the number of translates of (n + 2)A that cover dA n Z". By definition, we have 

dAnz"== IJ (1^ + ((n + 2)AnZ")) 

i/e(c!-(ri+2))AnZ" 
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so the number of reductions is at most - (n + 2))A n Z") = 0(((i - (n + 2))"^;) by 

Corollary 13. 31 

Remark 4.6. The fewest number of translates that one could hope for is 
Vol(dA)/Vol((n + 2)A) = (d/(n + 2))". 
But it may be that the combinatorics of A will not allow this. 

Each reduction involves multiplication of a vector by a square matrix of size 0{{n + 2)"w) = 
0{v) over i?, which can be achieved in time 0{v'^N\ogq), so reduction from degree d > n + 2 
to c? — 1 takes time 0((c? — (n + 2))"i;'^iV log (7) ~ 0{d"v^N log q). Similarly, reduction from 
degree d<n + 2toc?— 1 takes time 0{d^v^N log q). Repeating this for d = . . . , 1, gives a 
total time of 0{E'^+^v^N\ogq) = 0(/3"+i(iV + -^Y+^v'^NXogq) to complete the reduction. 

Step 4: Output 

Having assembled the square matrix A of size v, we compute the product 

Aa ^ AA"'' ■ ■ ■ A"'^"''^ 

(where a = log^q). It takes time 0{v^N\og'^ q) to compute applied to a matrix of size v, 
and time 0{v^ N logq) to multiply two such matrices, for a total time of 0(log(7(ti^iVlog^ q + 
v'^Nlogq)) = 0(v'^ N log^ q) to compute Aa- The characteristic polynomial of a matrix of size 
ti can be computed using 0{v^ N \ogq) ring operations, which is absorbed into the previous 
estimate. 

Total running time 

We now add up the contributions from each step, proving Theorem A. 

Step 1 takes time 0(sr"/2l J^v^Nlog^ q). Step 2 takes time 0{pN^\ogq + p'^'P [i^iN + 
-lYvN \og^ q). Step 3 takes time 0(/3"+i(A^ + -fY+^v'^Nlog^ q). Step 4 takes time 0{v^N\ogq). 
Since s > n + 1 as dim(A) = n, the time in Step 2 dominates, up to a polynomial in vNlogq. 
This totals to 

0(sr"/2l + pN^ log q + p^-Pp'iN + -fYiv'^N log^ q)) 
bit operations. Using the estimate /? < 6 and 7 < (n + l)/20, this becomes 

0(sr»/2l + pN^ log q + p''- P (eN + ny{v'^Nlog^ q)). (4.7) 
and if p > 5 then /3<20/ll<2so this improves to 

0(sr»/2l + pN^ log q + p''-P{2N + nYiv'^N log^ q)) . 

Precision 

We have thus computed the characteristic polynomial to precision N . To prove Theorem B, 
we estimate the value of N required to recover the zeta function itself. First, we factor this 
characteristic polynomial so as to work with p~^a instead of a. The matrix A has block form 

] where we order the monomials in the basis V for the space B by degree: the only term 

in degree is the monomial 1. The matrix Aa also has this form, since it holds for each term 
in the product. Therefore the characteristic polynomial of a on B factors as (1 — T) times the 
characteristic polynomial on quotient space Bq = B / R. The action of the Frobenius a on Bq 
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is divisible by p; let Aq be the matrix of a on Bq. Then 

z(F,r) = det(i-(p-i)Aor)(-i)" {■ 



T) 



(1-T) 



T=T/q, 



Therefore, we may compute with p~^a instead of a. 

The characteristic polynomial det(l — q~^{Ao)aT) then has inverse roots of absolute value 
at most q"/^ by a theorem of Adolphson and Sperber [3] and Denef and Loeser |14) . Therefore 

its ith coefficient is bounded by ("jg*^"/^-*. 



Lemma 4.8. For all x e M>i and v e Z>o, we Tiave 



V 



\v/2-]+j 



max \x — [ . ]x 

o<i<v \i J \\v/2] +J 

where < j < [w/2j is the unique index such that 



K2j-(j-l)- lv/2\-j 



The proof is an easy inductive argument. In the two extremes: ii x > v then j = lv/2\ and 
the largest coefficient is x^; ii x < ([w/2] + l)/[t;/2j (equal to 1 + 2/v if u is even, for example) 
then the largest coefficient is {^J/2i)^^"^^^- 1^ follows that the p-adic precision N required to 
recover all of these coefficients as integers from their reduction modulo is given by 

p''>^{^,/^^^^)i^''^')^'^''^' (4.9) 

where j is given as in Lemma [4.81 with x — g"/^. 

In practice, one will want to work with the precision estimate (j4.9p . To estimate the runtime, 
we have the crude bound 



which impHes we may take 

TIV 

N>{v + l) logp 2 + — logp q = Oinv log q). (4.10) 



Remark 4.11. We are forced to take a larger bound than just the middle coefficient because 
we do not have a Riemann hypothesis in this generality. For many varieties under consideration, 
such a hypothesis will give a better estimate on the precision, since the higher coefficients are 
determined by the lower ones. 

Also, work of Kedlaya |36j shows how to recover the zeta function often in practice with 
much less precision knowing only that it can be factored as a product of Weil g-polynomials. 

Plugging the estimate (|4.10p into (|4.7p . and considering s (and n) to be fixed, we obtain the 
estimate 

dip{v log qf log q + p'^Piv log qYv^v log q) log' q) = 0(p™(l.^-p)„«+5 log*+3 q) 
for the number of bit operations performed. This completes the proof of Theorem B. 
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5. Modifications 

In this section, we discuss some extensions and modifications to the above algorithm. 
Dense input 

We can also modify the algorithm for the situation of dense input. One can also forget the 
condition of sparsity and analyze the running time on dense input. Here, we do not use the 
expansion (|2.14p . but rather directly compute the product (|2.4p . The analysis above shows that 
the computation of the expansion 

ArpV(p-l) 
1=0 

can be performed in 0{spN^ log?) operations. We have s such terms in the product (12.41) . This 
product has monomial support in {Np'^/{p— 1))A so has 0{{pN)^v) terms. 

As in (14. 2p . we compute in Xq/p^ , and multiplying two polynomials in n variables with 
coefficients in i? = 'Lqjp^ with at most 0{{pN)^v) terms takes time 0{M{pN)^"'v'^ log q) if 
multiplied term- by-term. (See also Lauder and Wan |47l Lemma 30].) (The grading by w 
allows us to multiply more carefully, but this saves only a constant factor for fixed dimension.) 
Therefore, the product F{w,x) can be computed in time 

d{spN^ log q + sM{pN f'^v'^ log q) = d{sp^'' N^'^+^v^ log q). 

Therefore, one sees a benefit from the fewnomial expansion only when s < 2ri + 1. 

The time to compute a(w'^x^) requires negligible time in comparison, as it involves only 
exponent arithmetic and applying the inverse Frobenius a^^ . The other steps are unmodified, 
so the total time is 

0(sr»/2l +sp2»iv2"+ii>2logg-H (67V + n)"+ii>3iVlogg)) = 0(sr»/2l + sp^^iY^^+i^^ log 9) 

to compute the zeta function modulo p^ (the analogue for Theorem A), and plugging in the 
estimate (j4.10p for N and considering s fixed we obtain 

0(p2"t;2»+4log2"+2g) 

(for Theorem B). 

Modifications wlien p ^ 2 

For many applications (notably in coding theory), the computation of zeta functions and 
L-functions in characteristic 2 are particularly important. While it may be possible to perform 
this analysis, we do not do so in the present work. Instead, we mention some of the hurdles this 
analysis faces in characteristic 2 using the approach we have taken applying Dwork cohomology. 
(Some of these same hurdles, and others, also arise in other p-adic approaches.) 

Dwork's original p-adic study of zeta functions concerned a nonsingular projective hypersur- 
face V C P" ^ over defined by the vanishing of a homogeneous form f{x) G Vq[xi, ■ ■ ■ , Xn 1 of 
degree d. When gcd(2,p, d) = 1, Dwork constructs ap-adic cohomology space with an action of 
Frobenius such that the characteristic polynomial of Frobenius acting on cohomology gives the 
important nontrivial middle dimensional primitive factor of Z{V,T). In particular, if p = 2, 
Dwork's cohomology does not at present apply to smooth hypersurfaces of even degree in 
characteristic 2. 

In some cases, Adolphson and Sperber [2] are able to supplement Dwork's work when p — 
2 I d. For example, if p | n and we consider the Dwork family of hypersurfaces 

f{xi , . . . ,Xn) — ~^ X2 + ■ ■ ■ + X^ — Xxi, . . . ,Xn 
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in characteristic p = 2, then even though the family consists of singular hypersurfaces, they 
are nondegenerate with respect to the sublattice of generated by the support of wf{x). 

As a consequence (even when p — 2), there is a cohomology space such that the characteristic 
polynomial of Frobenius acting on this space is the nontrivial factor of the zeta function. But 
computing in the sublattice adds to the computational complexity. 

A second obstacle concerns convergence and our choice of splitting function 

ei{t) =exp{nt- {nt)P/p) 

where ttP~^ — p, which converges for ordp< > — (p — This led us to the space La = 

La{{p~^)/p^) consisting of power series with similar growth) and the operators Di — 
Xiid / dxi) + i:Xiw{df) / (dxi) acting on La- Our explicit reduction theory in the Jacobian 
ring depended on the operator norm of Xi{d/dxi) being p-adically smaller than the operator 
norm of TTXiw{df)/{dxi), so that the series (I2.14[) converges after reduction. This requires that 
l/{p — 1) < (p — l)/p, i.e. p < {p— 1)^, and this fails if and only if p = 2. 

Therefore, when p = 2, it is necessary to use a splitting function with better convergence 
properties. Let tt £ Q2 be a nonzero root of the equation 

7r^/8 + 7r''/4 + 7rV2 + TT = 0, i.e., tt^ + 27r^ + 47r + 8 = 

and let 

e,it) = exp (.t + + + M!) = e M-W]]- 

In this case we have ord2 > (ll/16)i. Working with the space La(11/16), reduction is 
possible (in La(11/8)) since 11/8 > l/(p - 1) = 1 when p = 2. 

The computational difficulties caused by using 6*3 (t) instead of 9i{t) are numerous. Not 
only is the reduction algorithm more difficult, but even the calculation of Frobenius is more 
complicated ^ Proposition 3.2]. 

Remark 5.1. Robba constructed a constant called TTRobba by Dwork with ordp TTRobba — 
l/(p — 1) with splitting function 0Robba(i) = exp(7rRobba(i — i^)) which has a better radius of 
convergence than Dwork's 9i{t). However, even with this improvement, the p-adic norm in case 
p = 2 of TTnohha.Xi'w{df)/{dxi) does not dominate the norm of Xi{d/dxi) on the appropriate 
Banach space. 

For these reasons, we do not deal at present with the case p = 2. 
AfEne varieties 

Next, we describe modifications to the algorithm to compute the zeta function of affine 
varieties. (At the price of some additional notation, one could consider the more general case 
where the variety is a combination of affine and toric.) 

Let f{x) = civx" e Fg[a;i, . . . , x„] be a polynomial. For a subset AC [n] = {1, . . . , n}, we 
denote f a{x) £Vq[xi : i € [n] \ A] the polynomial obtained from / obtained by setting = 
for all i & A. We say that / is convenient (with respect to the variables xi, . . . , x„) provided 

dimA(7^)=dimA(7)-#v4 

for all subsets AC [n]. Equivalently, / is convenient if and only if / has a nonzero constant 
term and a monomial xf' with di G Z>o for all i = 1, . . . , n. The notion of convenient is also 
called commode. 
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We suppose for the rest of this subsection that / is convenient and nondegenerate (with 
respect to A(/), defined as before). It is a consequence of the hypothesis of convenience that 
dim A(/) — n. 



Example 5.2. Suppose / has total degree d and A(/) is the convex hull of the set of points 
{0} U {dei}" where ei, . . . , e„ is the standard basis in M". Then we may write / = f^'^^ + g 
where /'^'^ is the form of / of highest degree terms and the total degree of g is less than d. 
Then / is nondegenerate if and only if ■* = defines a nonsingular projective hypersurface 
in P"-#^-i and = is nonsingular in A"^^"^ for all subsets AC [n]. 



Assuming then that / is nondegenerate and convenient, we can work more simply with the 
affine L-function L{wf,Gm x A",T), as follows. Let V denote the affine hypersurface in AJJ 
defined by / = 0. We modify the calculation in Section 1 by working with an affine exponential 
sum: 

SO 

- L{w7,G„,xA^,T) 

The computation of L{wf,Gjn x A",T) is for the most part quite similar to the toric 
calculation given earlier. We describe here the required modifications. Let ACS' = {l,...,n}. 

(A) 

Let denote the ideal in the ring La consisting of series having support in monomials 

divisible by xa — Yii^A^i- Under our hypotheses, the complex D, for the i-function has 
vanishing cohomology H^(fl ) for all i except i = n + 1 and 

rS 

H''+\n') = ^ -J- (5.3) 

where the Di are as in section 1. Note that here i?"+^(r2 ) is contained in the cohomology 
space La/ (E"=o DiL^). 

The required modifications are then simple. For example, the Jacobian ring has the form 



The reduction algorithm then only requires identification of monomials in the ideals L£ and 
L^^^^^ , and the recursive reduction process preserves the required divisibilty on monomials so 
the algorithm runs in every other way without modification. 

In this way, one can reduce the sizes of computations involved under the convenient 
hypothesis: one avoids calculation of the contributions to the zeta function coming from the 
intersection of the affine hypersurface with the coordinate hyperplanes. 



Projective varieties 

We now consider the modifications for projective varieties. 

Suppose that / G Fg[xi, . . . , a;„] is a homogeneous form of degree d with gcd(p, d) = 1 and 
that / = defines a nonsingular projective hypersurface Z C P^~^. Suppose further that = 
defines a nonsingular projective hypersurface in P"-#'4-i foj- all subsets A C [n]; such a 
hypersurface is said to be in general position. If gcd(p, d) — 1, then / defines a projective 
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hypersurface in general position if and only if / is nondegenerate (with respect to A(/)) and 
convenient (with respect to the variables xi, . . . ,Xn)- 

Here, the support of wf lies in the hyperplane — so dim A(w/) = n 

(or, equivalently, dim A(/) — n — 1). 

It is well-known that 

- p(r)(-i)""' 

^^^'^^ " {1 - T){1 - qT) ■ ■ ■ {1 - q^-^T) 

where P{T) is a polynomial of degree d~^[{d — 1)" + (— l)"(d — 1)) that represents the action 
of Frobenius on middle-dimensional primitive cohomology. By work of Adolphson and Sperber 
[3], the cohomology of the complex fl for L{wf,Gm x A",T) described above in this case 
has vanishing cohomology H^{n ) —0 for i n,n + 1 and that there is an isomorphism of 
Frobenius modules i/"+^(r2 ) = iJ"(ri ) with Frobenius on being q times the Frobenius 
on As a consequence, 

det(l - FrobT | H''+\n)) = P{qT) 

yielding the "interesting" part of the zeta function of Z. 

As in the affine case, the space 7?"+^ is isomorphic to the quotient defined in (j5.3|) . But we 
can simplify further. By the Euler relation, we have dDo = on. La so that 

n 

1=1 

This enables us to reduce the calculation by suppressing the role played by w entirely. To 
determine the monomials x'^ in La, we simply need to check 

x'' e Ma = {iye Z|o : \u\ - E^,;/, = (mod d)} 

and use w^'^^^'^x'^ . So the power of w enters only formally. 



Let 



and write 



L>i = Xi- 1- nxi——j 

oxi axi 




La = S ^ ayx" : eZg [tt] , ordp > [ ^ ^2 ) f ^ ^9 1'^l I W, 
Our object of interest for the reduction is then 



B" 



■'A 



v^n TS\{i} 



^A 

The preceding algorithms for reduction may be applied here as well; the appropriate powers of 
w may be formally inserted as necessary. 

Finally, as we remarked in the comments for the case p = 2, when p \ d there are further 
modifications that can be made by considering polynomials that are nondegenerate relative to 
a sublattice even in some exceptional singular cases [2]. 

Exponential sums 

In Section 1, we reduce the problem of computing the zeta function to the problem of 
computing the L-function of an exponential sum. But in many situations the problem of 
computing this L-function itself is of interest. In this case, there is no auxiliary or dummy 
variable w; the support of the p-adic power series in our space La consists of those lattice 
points in the cone over Aoo(/) which itself is the convex closure of the support of / together 
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with the origin. We note the earher work of Lauder and Wan 1481 149] who apply some similar 
approach to compute the L- function of a one-dimensional exponential sum (i.e., n = 1). 

We consider the case of toric exponential sums. (Considering affine exponential sums, when 
convenient, requires modifications similar to those above.) Let f{x) = ^j^'SuX'^ £ Fg[a;^] be a 
Laurent polynomial. We say / is quasihomogeneous if there are rational numbers ai, . . . ,a„ 
such that w{i') = 1 for all i/ E supp(/) where 

n 
i=l 

We restrict to the case of quasihomogeneous exponential sums as this conforms quite closely 
with our computation of zeta functions; the method could be adapted to the general case. 

There are fewnomial examples of quasihomogeneous exponential sums that indeed appear 
nontrivial. For example, consider a subset A — {i^*-^-*, . . . , C Z" of cardinality n + 1 with 

each element z/*^*) S A lying on the hyperplane H defined by aiXi = 1 for all i = 1, . . . , n + 1. 
When the convex hull A of A is not an n-simplex, then the L-function associated with 

n+l 

X 



fix) = 



on G"^ is not well-understood. Even when A is an n-simplex, such sums are not entirely 
understood. We know from Adolphson and Sperber [3] that if / is nondegenerate with respect 
to Aoo(/), then the complex D, (/,(G"j) for this L-function is acyclic except in dimension n 
and 

Now i(/, GJJj, r)(^^^ ^ G ZfCplir]. This is one complication. The second complication is 
that the reduction has a modification: now we bring things down by weight. This is analogous 
to the degree of w, and the argument is formally the same. The ring is still graded by the 
weight. 

If all vertices lie in a hyperplane (quasi-iiomogeneous), then the Frobenius matrix has the 
property that the elements belong to Tr'^Z^: the n and the weight move as one unit. (So we can 
do multiplications in the smaller ring.) Then the filtered ring is a graded ring. The only time 
when one has to do honest calculations in is in the final calculation of the characteristic 
polynomial. 



Twisted exponential sums on the torus 

Let f{x)E¥q[x^] be nondegenerate. Let xi,...,Xn be multiplicative characters of ¥q 
(possibly including the trivial character). Since the character group of is generated by 
the Teichmiiller character w, each Xi may be identified with an integral power of uj with 
< tti < g - 1. It is useful to write = t^"' = where 7i = a^/{q - 1) e [0, 1). The 

shifted lattice A (7) = (71, . . . , 7„) + Z" plays an important role in the cohomological study of 
the twisted sums 

5(7, 7, G:;)- X uj''^{x,)---u''"{xn)e(f{x,,...,x^)) 

and the associated L-function L{j,f,T). 

We modify now our earlier work on quasihomogeneous nondegenerate toric sums to include 
the case of twisted sums. For € Z" we define w{i') to be the smallest rn G Q>o such that 
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p — 1 

where Mj{f) is the intersection of the cone over / intersected with A(7). 

As before, let / denote the Teichmiiller hft of /. The Frobenius map a = ipqO exp(7r/(x) — 
/(x^)) acts on La and the Dwork trace formula for L(7, /, T) takes the form 

i(7j,r)(-i)"^^=det(l-Ta|4^)r 

where g{TY — g{t)/g{qT) for g <E Cp[[T]]. As usual, we define differential operators 

9 df 

Di = X^- h TTXi—— 

axi oxi 

on La, and construct a complex f2* using La as the base and boundary operator as before. 
The Frobenius defines a chain map on this complex using a, and the hypothesis that / is 
nondegenerate implies that = for i ^ n and 

L(7,7,r)(-i)"^' = det(l - aT \ 

where 

is a free Zp[C(5-i)j,]-algebra of finite rank Vol(A). Our calculation of the matrix of Frobenius 
acting on H^{n') follows the same argument used above in the case of ("untwisted") quasi- 
homogeneous nondegenerate toric sums. 

Multiplicative character sums on the torus 

Continuing with this line of analysis, let x = — w'''"^^^^-' be a nontrivial multiplicative 
character of F^. We extend x to all of ¥q by setting x(0) = 0. As in the previous section, let 
/ € ¥q[x] be nondegenerate, and consider the character sum 

and its associated L-function L(7o, /, T). 

We now use an elementary character argument to transform such a multiplicative character 
sum to a twisted exponential sum of the type considered in the previous section. Suppose that 
X is nontrivial (i.e., 70 0). Then for m £ we have 

J2 x-'imtu) = -iG{x-\e))x{u) 

where 

Gix-\Q) = -Y. X-\vMv) 

is the negative of a Gauss sum. Since x is nontrivial, this identity holds for u = as well. 
Therefore our sum of interest 

5(70, 7, G™) = -Gil-jo,er'S{j,w7,Gl]+') 

where 7 = (1 — 70, 0, . . . , 0) G and the exponential sum on the right is a twisted sum of 

the type in the preceding section. 
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By the Hasse-Davenport relation on Gauss sums, we have 

L(7oJ, G'^, Gix-\ e)r)-i = L(7, wj, <G,-J\T). 

Note wf is always quasihomogeneous, as the monomials all lie in the hyperplane in W^'^^ with 
equation w = 1. If / is nondegenerate, then we may proceed to compute the L- function as in 
the previous section on twisted sums. 



6. Examples 



Elliptic curve point counting 



In this subsection, we give an example to show how our methods can be used to compute the 
zeta function of an elliptic curve. In this situation, our method is not competitive with more 
specialized methods, but running through the algorithm in this case will hopefully shed some 
insight on how it works. 

Let p > 3 be prime. Let f — + ax + b — y"^ &Wq[x, y] be such that 4a'' + 276 ^ 0, so that 
/ = defines an affine piece of an elliptic curve E over Fg. (This does not cover all cases 
when p = 3; we leave the other examples to work out by the interested reader.) Let / = 
x^ + ax + b ~ y'^ be the Teichmiiller lift of / to Zq[a:, y]. 

If 6 = 0, then E has complex multiplication by Z[i] and is a twist of the elliptic curve 
y^ = x^ — X, so the zeta function can easily be recovered by classical methods; the same is 
true if a = 0. So we assume that ah ^ 0. Therefore, the polytope A is the triangle given by 
the convex hull A({(0, 0), (3, 0), (0, 2)}). One can check that / is automatically nondegenerate 
given the nonvanishing of the discriminant 4a'' + 275^ ^ 0. (See also work of Castryck and the 
second author [TTl .) Furthermore / is convenient (with respect to x,y). 

In this situation, we have 



where 

Z(E,T) 



L{wf,<G„,xk',T) = P{qT) 

P{T) _ l-agT + qT^ 



(l-T)(l-gT) (l-r)(l-gT) 
and Cg = g + 1 - #E{¥q) has \ag \ < 2y/q. 



Remark 6.1. Here, we can see the advantage of working with the affine curve rather than 
the toric curve. First and foremost, the computations are performed in a cohoniology space 
of dimension degi(/, T) = 2, as opposed to one of dimension degL*(/, T) = Vol(A) — 6. This 
difference is accounted for by the number of points on E along the coordinate axes, as follows. 
From the relation 

Z(E,qT) = L*{wf,T)Z{Gl,qT) = L*{wf,T)- ~ 



(l-T)(l-g2T)' 

after some cancellation we find that 

L*iwf,T) = (1 - T)P(qT)P.,{qT)Py{qT) 

where Z (E f] {x = 0} ,T) = P^{T)/{1 - T) is a polynomial of degree 1 which is 1 - T or 1 + T 
depending on if 6 £ is a square or not; similarly, Py{T) is a polynomial of degree 2 which 
depends on the factorization of + ax + 6 in Fg. 
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We have the graded ring 

ZJ?«A] =^Z,{w''x'y' : G dA) C Zg[[w,x,y]] 



d 

and work in the Jacobian ring 

J 



{wf,Wf^,Wfy) 



where 



r 3 

Jx = X— = 3x + ax 
ox 



fy = ~2y'. 

The afhne Koszul complex associated to / (see Section 5) is 



^ La A La ® © l'^^ A L^^^ ® L^^^ ® L^^^^ A L^^^^ 



where L^'' C La the subspace divisible by the monomial m, and D ~ {D^, D^^ Dy) where 

d d d 

^w—+TTwf, = x—+TTwf^, Dy^y— + Trwfy. 

We work in the cohomology space 

B = - 



We then consider the free Z^-module 



In weight 1 we have Ji = Zqwxy, since this is the only monomial divisible by xy and it is 
nonzero in the quotient. 
We have 



J2 = w 



2 span({xy, x'^y, x^y, x^y, xy^, x^y"^, x^y^,xy^}) 
spa.n{{xyf, yfx,xyfx,y^fx, xfy, x^fy, x^fy,xyfy}) 



2 span({l,a;,a;2^a;3|) 

= w xy -— ^- , , ^ „ 5-^^- YT ^^q-w xy. 

span(|a;-^ + ax + 0, + a, csx'' + ax]) 

So the monomial basis V we take is wxy^w'^xy. 

Since |ag| < 2y^, we recover Uq miiquely as the integer a such that aq = a (mod p^) and 
|a| < 2^ with > 4yg, so > log^ 4 + (1/2) log^ g or iV = 0(log9). 

For V = wxy, w^xy, we expand as in (j2.1ip . We first take v — wxy. Then we have 

K = {e: Me = -(1, 1, 1)* (mod p)} 

where 

M = 

We compute then that 

K = {-(1/4,1/4, 1/2,0) + t(l, -3, 0,2) : t G Z/pZ). 
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Note that for k = (fci, ^2, k^, fc4) E K wc have simply k^ — (p — l)/2. Thus 

a{wxy) = J2 fT~^(a)''H-l)''-V"^(&)''''(-p)*''''+^^/^(7rw)(l'^l+i)/Px(=''=i+'''^)/^'y 
feeif 

(6.2) 

J2i^P) ' (- 1 {TTWp X^^' y2e3 

We now reduce the series (I6.2p . Our algorithm to do this would proceed by degree; to make 
the resulting matrices digestible, we alter the method slightly. We reduce first with respect to 
the monomial , or "vertically" . We have 

Dyiiv'^x'^y''-'^) = {v- 2)«;Vy"-2 + TTwfy{w'^x''y''-^) =OeB, 

and wfy = —2wy^ so 

Tnu'^x^y" = ^iv- 2)w'^-^x'^y'"-'^ G B. 
By induction, if v is odd we have 

(™)L'^/2Jx-y" ^ ^ ((^; - 2)(« - 4) . . . 3 • 1) x^y = ^2] !(-!) L^/^J {^^j^^x^y. 
Thus in the series (|6.2p we have 

e>0 e>0 ^ \ ^3 J 

We now reduce with respect to x^ , or "horizontally" , then finally reducing with respect to 
the origin. We compute the matrix Pj.3 such that 



/ w^x^y\ 






w^x^y 






yw^x'^y J 







and having coefficients in 'Lq{w x y, w x y, w x y). Some linear algebra then yields 

22 / 3ax(2aa; - 35) ~2a^x'^ + 'ib{ax + ih) -\ax(2ax ~ ibf 

P^3 = ^ -ax{9bx + 2a2) 3abx^ + (20^ + 9b^)x + 2a^b ^ax{9bx + 2a^) 

4a +276 y^a^^(^2ax^3b) {2a^ + 9b^)x^ ~ ab^ax + Sb) ^a^x{2ax - 3b) ^ 

Now, if 
then 

. d d d ^ 

This will allow us to write 

(w'^x'^y\ /w'^x'^yy 

w'^x^y = D^3{d,u)w'^~'^x^~^ w'^x^y 

w^x^yj \w^x^yj 

for a matrix D^sid, u) with coefficients in Zg[(i, u] which are linear in d, u. 
The rows of the matrix are obtained as follows. For « = 1, 2, 3, we have 

[wx'^){'U?x'-^'^y) = Piwfw +Pixfx +Piyfy 

where {piw^Pix^Piy) is the ith row of P^s. Thus 

W^X-^ = {w'''^X^~''){wX^){w''x'+^y) = {W^-^X^-^) {p^^Jn,+P^xfx+P^yfy) 
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and hence 

-TTw^x'^iw^x'+^y) = w-^iw^-^x^-^p,^) + x^iw^-^^-^'p,,) + y^iw^-^x^-^p.y) 

= {d + l)w'^~^x''-^pi.^ + w'^-'^ {{u - 3)x"-3p„ + x^'-^pL) + w'^-^x^'-^pr 
= w'^^^t""^ {{d + + {u- 3)|5„ + xp',^ +p,y) . 

From this we obtain that (4a'^ + 27b^)Dx3 {d, u) is equal to 

-3a&2(u-l) ia25(6d- 2it + 3) -2a3d + (2a3 + + (a^ + 

2a^b{u-l) -2a^d+ {2a^ + 9b'^)u-a^ -|a6(6d - 2u + 1) 

-|a6(6d-2u-3) a^{6d-2u+l). 

In a completely analogous manner, we obtain the matrix Di satisfying 

(w'^x'^y\ I w'^x'^y^ 

w^x^y = Di{d,u)w'^~^x'^ w'^x^y 
w^x'^yj yw'^x'^yj 

and indeed we have b{4a^ + 27b^)Di is equal to 

^(4a3 + 2762)d- (4a3 + 962)u- i(12a3 + -3a6(6d - 2u ~ 3) 2a^ {6d - 2u - 5)'^ 

Aa%{u + 2) " |62(6d-2u-3) 3ab{6d-2u-5) 

-6ab^{u + 2) aHi6d-2u-3) ^b^6d-2u- 5)^ 

Note the additional need to invert b for the matrix Di. 

To reduce the series (j6.2p ~ (l6.3p . we expand and rewrite it as 

a{wxy) = ^ w'^x^y ((cio + cnx + Ci2x'^) + wx^{ca + c^^x + a^x^) + . . . ) 

i>0 

= ^ w'^x^y ^ a;^-'(cj,3j+2 + Ci,3j+3a; + Ci^3j+4a;^) 
i>0 \j>o 

with Cij e Zg. We have 

ordp(c,,,0 > (i+ L//3J)^ 

so up to precision p^ we need only take i < p/{p — 1)N and 

i < J - - z. 

Each sum in (|6.4p is reduced using the matrix D^-a : if we abbreviate 

Cj(j) = I C,,3j+3 



(6.4) 



and write 



we have 



Dx^iiJ) = Dx3{i,0)Dxa{i + 1,3) • ■ ■ D^^lj + z,3j) 

j-i J 

ly'a;^?/ ^ CywLj/^Jj.j ^ w'x'^y^ D^sii, j)ci{j). 

For what it is worth, one can check that this algorithm runs in time 0(plog^ q). 
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Fermat-like hypersurfaces 



In this subsection, we show how the method works in the simplest case where one has a 
Fermat-hke affine hypersurface defined by 

J = aix'^^ +--- + a„<" + 6e F,[a;i,...,a;„] = ¥q[x\ 

where rrii € Z>o and ai • • -cinb 7^ 0. These were the varieties considered by Weil |61j in his 
seminal article on zeta functions. Koblitz |39) has studied these and shown that the number 
of points is given by Jacobi sums, which can be expressed by Gauss sums; see Wan [60] for an 
explicit algorithm which uses this method. In an early related work, Delsarte [13] studied the 
number of zeros of a polynomial 



9 = J2^i 



Mi) 



c G IFg [xi , . . . , X^l 



i=i 



in ¥q and its finite extensions, where i^lj) G Z"q; he described an explicit formula for this 
number in terms of Jacobi sums. 

The polynomial / is nondegenerate if and only if p | mi • • • m„, which we now assume. 

Let / = aix"' H h a„a;"" + b e Zq[x] be the Teichmiiller hft of / to Zq[a;]. We then have 

the polytope 

A = A(/) = A({(mi, 0, . . . , 0), . . . , (0, . . . , 0, m„), (0, . . . , 0)}) 

with normalized volume Vol(A) = n! vol(A) — nii ■ ■ ■ urin. 
Here, the affine complex gives the coliomology space 



B 



where 



for i = 1, . . . , n, and 



d d 

Dw = W- h 7r/„, Di = Xi- h TTfxi 

aw oxi 



fw = wf, fx, ^ {waimi)x'^' 



also for i — 1, . . . ,n. 

A basis V for the space B is computed as follows. In weight 1, we have Vi consisting of 
lattice points in A not on a coordinate face, 

Vi = {wxf" : {l/m)^l = J2^^^^/m, <1, ^l> 0}. 

In a similar way, in degree d, we obtain 

Vd = {w'^a;^ : i - 1 < (1/m)^ < i, ^ > 0, ^ < m}. 

In particular, we see visibly that Vd = {0} for d> n + 1. 
Now we consider each series expansion (|2.14p . We have 



K = {e : Ue = fi (mod p)} 



where 



U : 



nil 



\0 



1\ 
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which is an invertible {n + 1) x {n + 1) matrix, so K consists of the single element 



For simplicity of notation, we consider the indices of the columns to be 0, . . . , n. 
The reduction steps are similarly simple. Since fi = airriiwx^' , we have 









to/ . . 


• \ 















k = M-\-fi} = 















U 











whenever fj,i > mi. Similarly, from the equality 

1 
b 



" o(toi • • • m„) 



which implies 



{ttwY 



d- 1 



Inductively packing these up, in the terms of the expansion 

\e=(eo,...,e„)>0 

e>0 

and letting k = {kv + = (^{kiirii + fii)/p^._^ ^ we have 

n 
i=l 

where 

(z; q)r = {z- q){z - 2q) ■ ■ ■ {z ~ rq). 
If we let Too = 1 and kq = (|fc| + then we can abbreviate 

n 

(cTO + k; m)e = ]^(eiTOi + k^; mi)ei 
and substituting back into the sum, we obtain 

a{{TTwfx'') = (7ru;)(l'=l+'^)/fx« ( a-i(a'=)(-p)(l'^l+'')/P Epl"l4+pe(eTO + k; to)^ 



In this way, we have written down the reduced value in one stroke. 



Gabber hypersurfaces 

There are few works (if any) in the existing literature which give the computation of the 
zeta function of a projective hypersurface defined over of degree d with p \ d. Here we study 
the zeta function of such hypersurfaces for a particular family going back (according to oral 
communication from Nicholas Katz) to Ofer Gabber (see for example [331 11.4.6]. 
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Let d be a positive integer with p \ d and let 

n 

f{x) = aix'( + ^ UiXi^ixf^^ e ¥q[xi, ...,Xn] (6.5) 

1=2 

with aia2 • • • a„ 7^ 0. Let Zq be the projective hypersurface in Pp"^ defined by / = 0. The 
hypersurface Zo is easily seen to be nonsingular. Note that its defining equation (|6.5p is 
fewnomial. 

We consider somewhat more general hypersurfaces as follows. Let c = (ci, . . . , c„) be an n- 
tuple of positive integers such that |c| = J27=i '^i ~ ^- Suppose further that c is an interior point 
in A(/). We consider here the family of hypersurfaces Zx defined by 

Since Zq is nonsingular and the condition of nonsingularity is open, there is a closed subset 
W ^ A} , defined by the vanishing of a polynomial in [A] , such that Z\ is singular if and only 
if A e M^. 

If A ^ 14^, then we recall 

Z{Zx,T)- > 



{l-T){l-qT)---{l-r~^T) 
with the degree of Pa equal to ^-^(((i- 1)" + 1)). 

Example 6.6. An example of this situation is provided by the family of elliptic curves 

Z\: x\+ Xix\ + 2:22:3 + Aa:i2:22:3 

in characteristic 3. Here the singular locus is given by (5(A) = A"' — 1 = 0, the fourth roots of 
1. We note that the j-invariant of Z\ is equal to j{Z\) ~ A^^/(A'' — 1). 



It will be useful to consider the exponential sums (and their associated L-functions) for 
wfxix) as {w, (xi, . . . ,Xn-i),Xn) runs over various spaces which are products of tori with 
afhne spaces. For example, we recall from the sections on affine and projective hypersurfaces 
in Section 5 that 

LiwJ„G,r^ X A"-i X A\r)(-i)""^ = (6.7) 
It is our intention to show that (|6.7I) has the form 

pxm n"=o^.(r) 

where Rr(T) is a polynomial or reciprocal polynomial in Z[r] for r = 0, . . . , n — 1 which we give 
exphcitly. But t_hen n"=o ^r(T) = PxiqT) so that this calculation of Rr{T) for < r < n - 1 
in fact gives Z(ZA,r)- 

Since fx{x) is homogeneous, we have dim A(/;^) = n~ 1. For A ^ W, the nonsingularity of 
Zx implies that wfx is nondegenerate with respect to its maximal face (not containing 0). It 
is easy to see it is also nondgenerate with respect to its other (lower-dimensional) faces not 
containing 0. 

We cannot apply directly our results from the section on projective hypersurfaces in Section 
5 because p divides the degree d and moreover wfx is not convenient with respect to the 
variables xi, . . . , x„. It is, however, convenient with respect to x„. We proceed by partitioning 
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Gm X A" ^ X and particularly its middle factor as follows: 

n-l 

A"-i = Gl'-^ U y f/, (6.8) 

where Ur — GJ^T^ ^ {0} ^ A"^^^''. For notational convenience, we will sometimes write Uq — 
G^~^. Note that the terms Ur for r — 0, . . . ,n — 1 are pairwise disjoint, so that if we set 

Ur ^ G-m X Ur X A^ then 

n-l 

L{w7^,G„^ X A"-i X A\r) = [] L{w7^,U:.,T). 

i=0 

By work of Adolphson and Sperber [3], the complex Q' {wf^^, Uq) for L{wf ^, Uq, T) is acyclic 
except in dimensions n and n + 1 and 



RoiqT) 
where 

i?o(r) = dct(l - FrobT I H^+\n{w7^, U^)). 

We see easily, from the relation of Ro{T) and the zeta function for the variety defined by the 
vanishing of Ja in GJ^^ x A^ that Ro{T) e Z[T]. The calculation oi Rq(T) follows easily from 
the suppression of the w terms and the isomorphism 

Consider now for 1 < r < n ~ 1 the L-function associated with the exponential sum 
S{wJ^,U^). Write 

r-l 
i=l 

and 

n 

i=r+2 

Note that substituting = in f \{x) gives 

7A(^)L,.o=7i'''(^)+ffA'^(^)- (6-9) 
Note also that despite the notation, none of the polynomials in (|6.9p (for r > 1) depend on A. 
Thus 

S{wJx,U'r)^ X! ^{wl'^\\xi,---,Xr^l))[ ^ Q{wg''l\xr+l,.-.,Xn)) 

For any G Gm, the inner sum is easily seen to be q^~^^'^^ i.e. 

S{wl^,U'r)^q^^-^-'S{wf^\Gl,) 

and 

Consider the complex ^' {wf^l\G''~^^) for the L-function L{wf''^\G'^,T). It is acyclic 
H^{n ) = except for i = r — 1 and i = r and if we let 

i/('-)(T) =det(l-Frobr | H"- {Q {wf^\, G';j)) 
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then 



L(u;/r,G;„,r)(-i) 



Thus if we let 



Rr{T) = det(l - q"-'-iFrobr | {n {w , G'm))) 



(-1) 



then as with Rq{T), we have i?r(r) e1[T]. 
We can calculate Rr{T) using 



where we have suppresed the factor of w. Note that in the case r = 1 




1, we have 




so that Ri{T) = (1 - Similarly, if r = 2, then 




so R2{T) = (1 - 
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